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The zero-mass (ZM) parton formalism is widely used in high-energy physics because 
of its simplicity and historical importance, even while massive quarks (c, 6, t) are playing 
an increasingly prominent role in particle phenomenology, including global QCD analyses 
of parton distributions based on the more precise general-mass (GM) QCD formalism. In 
view of this dichotomy, we show how the obvious inconsistencies of the conventional im- 
plementation of the ZM formalism can be corrected, while preserving the simplicity of its 
hard matrix elements. The resulting intermediate-mass (IM) scheme for perturbative QCD 
calculation can be considered either as improved ZM formulation with realistic treatment 
of heavy-flavor kinematics; or as a simplified GM formulation with approximate ZM hard 
cross sections. Phenomenologically, global analyses based on IM calculations can effectively 
reproduce, within the present estimated uncertainty bands, the more correct GM results on 
parton distributions, as well as their predictions for a wide range of collider processes of 
current interest. 
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1 Introduction 



Global QCD analysis is based on factorization theorems of perturbative quantum chromody- 
namics (PQCD) at high energies. It allows the determination of universal parton distribution 
functions (PDFs) by comparing the QCD parton formulas with a wide range of available hard 
scattering data (which involve at least one large energy scale, generically referred to as Q). 
The conventional derivation of the factorization theorems in PQCD is formulated in the 
zero-quark- mass hmit and is valid to 0{Aq(^jj/Q'^) [1]. Since QCD interactions depend on 
the number of active quark flavors, and this number (denoted hy Uf) varies in practice, it is 
necessary to vary Uf in the theoretical calculations according to the effective energy scale 
of the interaction — it is incremented by 1 every time ^ crosses one of the mass thresholds for 
heavy quarks [2]. In the common implementation of this so-called variable-flavor-number- 
scheme (VFNS), the changes of n/ with the factorization scale across heavy-flavor thresholds 
are taken into account in the evolution of PDFs and in the summation over active parton 
flavors in the factorization formula, but heavy-quark mass effects are ignored in the hard 
matrix elements, as well as in the evaluation of the phase space of final states in the convolu- 
tion integral. This provides a simple and convenient working platform for most calculations 
in high-energy phenomenology; and it has been widely used both in practical calculations 
and in global QCD analysis of PDFs. We shall refer to this calculational scheme as the 
conventional zero-mass (ZM) VFNS. 

As the PQCD theory advanced, and the proper treatment of heavy quarks became 
more important, it was recognized that the validity of factorization ought to be extendable 
to PQCD with non-zero quark masses [3,4], i.e. to be vahd to order A.qcd/Q'^ uniformly, 
independent of the ratios rrih/Q of quark masses nih for h — c, b, and t. This was formally 
proved for inclusive deep inelastic scattering in Ref. [5]. This generalized factorization, which 
does not assume mj/Q — > for all parton flavors i as in the ZM formalism, provides the basis 
for an improved global QCD analysis formulated in the general-mass variable-flavor-number 
scheme (CM VFNS). Recent CM VFNS global analyses [6-8] demonstrate that the proper 
treatment of heavy-quark mass terms is essential for reliable predictions of cross sections at 
the Tevatron and LHC. This is because a good fraction of input precision data from DIS and 
flxed-target experiments included in the global analysis are at energy scales comparable to, 
or not too far above, the charm and bottom masses [9] ; hence the PDFs are sensitive to the 
more precise treatment of mass effects in the CM scheme. 

Nonetheless, because the ZM PQCD formahsm is much better known, and the requisite 
hard cross sections (Wilson coefficients) are much more readily available, the ZM VFNS 
remains ubiquitously used in particle physics applications, as well as in many contemporary 
global analyses of parton distributions [10-12]. With this in mind, it is useful to reexamine 
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the ZM calculations from a new perspective, to see whether it is possible to preserve its 
main simplifying feature, massless hard-scattering cross sectionsjl] while correcting its other 
obvious inconsistencies. These inconsistencies, largely overlooked before, were recognized 
in [6]; the possibility of formulating improved implementation of the ZM scheme was raised 
in [9]. In this paper, we address this problem in detail. We are motivated by the observation 
that the differences between the GM and ZM PDFs that caused the observed shifts in the 
LHC W and Z cross section predictions arise mainly due to the kinematical suppression 
of heavy-flavor scattering cross sections near their respective mass thresholds in the global 
analysis. Inclusion of the correct kinematic treatment in the ZM global analysis can bring 
the resulting PDF's much closer to the GM analysis, while retaining the simplicity of the 
massless hard cross sections in the ZM scheme, if this is done with due care. We focus 
on the next-to-leading order (NLO) global analysis, which benefits most directly from such 
improvements. 

In Section [2|, we analyze the essential components of the variable flavor number scheme 
and identify those parts of the conventional ZM implementation that conflict with kinematic 
aspects of heavy- flavor production. Based on this analysis, in Section [3l we propose practical 
methods to correct these problems, leading to a class of "intermediate-mass" (IM) schemes 
that potentially can serve as effective approximations to the GM scheme. Numerical compar- 
isons of ZM/IM/GM calculations are presented. In Section HI we perform parallel global fits 
based on the ZM/IM/GM schemes, compare the effectiveness of the proposed IM schemes, 
and compare the PDFs and theoretical predictions for representative collider cross sections. 
Alternative choices of the rescaling variable in the GM scheme are considered in Section 14.41 
Concluding remarks are given in Section [5l 



2 Conventional ZM VFNS 

The general PQCD factorization for high-energy hard processes, exemplified by the inclusive 
DIS structure functions -F\(x,(5^), has the form 

F,{x,Q') = 5^ /.(e,/i)C,",Af7,-,-,«.(/i)') 

= J2[fa^Cl,]{x,Q,m,,^). (1) 

a,b 

Here /a(^, /i) is the parton distribution function for an initial-state parton a with momentum 
fraction ^ at the factorization scale /i, C^ ^^ are hard cross sections for the scattering of parton 

^It should be noted that ZM and GM hard cross sections are practically indistinguishable if the typical 
energy is much larger than rric and mf,, as is the case in most collider phenomenology. In this high-energy 
regime, ZM hard cross sections are compatible with the PDFs obtained by global analyses in either mass 
scheme. What we are concerned here are the differences between the ZM and GM calculations at low Q that 
affect the global analysis of the precise DIS data from HERA at small x. The resulting differences in PDFs 
can affect predictions of physical observables at both low and high energies. 
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a into a final-state parton b, and rrii collectively represents the quark masses. The perturba- 
tively calculable hard cross sections C^^^ are also variously known as Wilson coefficients, hard 
matrix elements, or, least formally, matrix elements. In VFNS, initial states a are summed 
over the active parton flavors at the factorization scale fi, typically chosen to be = Q 
(and equated here to the renormalization scale). The lower limit of the convolution integral 
X is a function of the DIS kinematic variables (x, Q) determined by flnal-state phase-space 
constraints in each parton-level scattering subprocess. 

In its general form, the factorization formula ([T]) applies both to the GM and ZM 
VFNS. With the insight gained from the modern formulation of the GM VFNS, we will now 
dissect this formula in the conventional implementation of the ZM formalism and expose the 
elements that are usually taken for granted together. (Since we always work with the VFNS 
in this paper, we shall omit the VFNS designation for both GM and ZM cases from now on.) 

(i) ZM hard matrix elements. Central to the ZM formalism, the hard matrix elements 
in Eq. ([1]) are taken to be the standard ZM ones, C^;^ = C^^^ ( |, — , 0, ^^(/x) ) : they are 

' \« ^' /MS 

calculated under the assumption that all active partons are massless, with the associated 
singularities subtracted in the MS scheme. Their expressions are well-known — to NNLO in 
the QCD coupling as{fi) for DIS and Drell-Yan processes [13], and to NLO for many other 
processes. 

For the DIS process, the hard matrix elements are also known for non-zero quark masses 
to order [14], but the zero-mass formulas are much simpler and easier to use. Hence the ZM 
calculation is commonly used because of its simplicity and convenience. On the other hand, 
for almost all other physical processes, such as Drell-Yan pair, W/Z boson, and inclusive jet 
production, the GM hard matrix elements are not available beyond the leading order. Thus, 
by necessity, the ZM matrix elements are still widely used both in physical applications and 
in global analyses in general. 

(ii) Final-state counting. The summation of the final-state parton fiavors in Eq. ([1]), ^j,, 
is conventionally taken to be over all active parton fiavors at scale /i, the same as for the 
initial-state summation 

This convention originated from the strict zero-mass parton formalism of the 1970's and 
80's, but is clearly problematic from the modern perspective. As emphasized in [6,9], there 
is an important conceptual difference between the initial- and final-state summations 
and Ylb- Xla runs over the active parton fiavors, a theoretical concept dependent on the 
choice of the renormalization scheme for a^, the factorization scheme for the PDFs, and, 
of course, the value of fi. On the other hand, Ylb involves summation over physical final 
states that are kinematically allowed at the given scattering energy. Determined by physical 
kinematical considerations, Ylb should be independent of the choice of the renormalization 
and factorization schemes and the scale variable /x. 

This dichotomy leads to unintended inconsistencies for the conventional ZM calculation. 
For example, consider a small-x kinematic configuration common at HERA, say x = 10~^ 
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and Q = 3 GeV, corresponding to virtual Compton scattering center-of-mass (CM) energy 
W = Q y/x'^ — 1 = 300 GeV. Because the factorization scale fj, = Q = 3 GeV is smaller 
than the bottom quark mass mb, the bottom quark b in this calculational scheme is not 
counted as an active parton; thus the final-state parton fiavor summation does not include 
b. However, in reahty bottom quarks are easily produced at this CM energy W, as is indeed 
experimentally observed. The problem is even more pronounced for Q around and below the 
charm mass rric. 

(iii) Phase-space treatment. In ZM calculations, the convolution integral y in Eq. ([1]) 
is usually computed using massless parton kinematics, so that x in the lower integration limit 
is equated to the Bjorken x = Q^/(2g ■ p). For the leading-order quark scattering process 
V*a b (where V* stands for the virtual vector boson '~f*/W/Z), this leads to the well- 
known result Fx{x,Q^) = '^ab^b,\ fai^^Q)^ where C^;^ are the appropriate (electroweak) 
coupling parameters. 

Because this integral originates from summing over final-state phase space of real par- 
ticles, this practice leads to violation of Lorentz kinematics in the case of heavy-fiavor pro- 
duction. For instance, in neutral-current DIS at Q > nib and W < 2mb (say, x ~ 0.3), this 
calculational scheme will predict similar contributions from b quark production and d sea 
quark production (since ZM hard matrix elements are fiavor- independent), whereas, in fact, 
this kinematical regime is below the 6-production threshold, and the b and d scattering cross 
sections are completely different @ 

Past global QCD analyses carried out in the ZM scheme have overlooked these unphysical 
features, since the heavy-quark contributions to fiavor- inclusive cross sections are relatively 
small. The resulting deviations from the true behavior are compensated in the global analysis 
process by the fitting process, leading to very good agreement with the examined data. 
Thus, many ZM PDFs (including CTEQ6 [10] and CTEQ6.1 [11]) have been widely used in 
comparisons of current experimental measurements to theory, and in making predictions for 
new processes in collider phenomenology^! 

Having identified the unphysical features of the conventional implementation of the ZM 
scheme in this section, we can now investigate whether these elements can be corrected, while 
preserving its simple matrix elements, to produce a simple and effective approximation to 
the GM theory. This is worthwhile, since the ZM formalism is very versatile and practicable; 
and it is still used in the vast majority of applications. 



^Additional kinematical effects may be relevant in semiinclusive observables, e.g., due to the heavy-quark 
masses in resumnied multi-scale differential distributions [15] or final-state hadron masses in heavy-quark 
fragmentation [16]. 

^Parallel PDF sets in the GM VPNS, such as CTEQ5HQ/CTEQ6HQ [17,18], were also provided along 
with CTEQ6 and CTEQ6.1. They have not been used as widely in general applications, but served as 
standard PDPs in the analysis of heavy-quark production. 
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3 Intermediate mass VFNS 



From the presentation of the last section, we can see that the particulars of counting final 
states and of treating final state phase space in the conventional implementation of the ZM 
scheme (points (ii) and (iii) of the previous section) are not intrinsic to the ZM treatment of 
hard matrix elements (point (i)). With the insight gained from the GM formulation of [6], 
we can try to eliminate these kinematic contradictions on inclusive cross section calculation 
from the ZM scheme. There is no unique way to do this, even though the underlying physics 
provides valuable guidance on how to proceed. We shall now discuss the possibilities. 

3.1 Satisfying heavy-flavor production kinematics 

First, consider the issue of phase space integration for a given partonic subprocess con- 
tributing to the right-hand side of Eq. ([1]), say the LO V* + q —>■ h, where q denotes either a 
light or heavy quark, and h a heavy quark (c, b, t) . In the conventional implementation of ZM, 
for zero-mass kinematics, one gets the contribution Fa(x, Q^) = ^ fq{x,Q), which runs 
into serious problems if W is near or below the production threshold for the heavy quark. It 
has been known since early days of charged- current DIS [19], and, more recently, from the 
GM approach [6, 9, 20] that the phase space constraint due to heavy-quark masses can be 
naturally implemented by replacing the Bjorken x above by a resettling varittble "ACOT-x" , 

X = x{l + Mf/Q'), (2) 

where Mf denotes the total mass of the final state {rrih for charged- current DIS, 2mh for 
neutral-current DIS), i.e. F\{x,Q'^) = C^'^^^ fq {x{x,Q) ,Q). One can easily verify that, as 
W approaches the heavy-quark production threshold Mf from above, x 1; thus Fx 
in the heavy-quark production channel, as it shouldB 

This problem is general, applicable also to NLO and beyond: by extending the lower 
bound of the convolution integral to Bjorken x in Eq. ([1]), the conventional ZM formalism 
grossly overestimates the contributions from the region of phase space near (and even be- 
yond) the physical thresholds for all ehannels. A straightforward way to correct this problem 
is to replace the lower limits of the convolution integral with the equivalent rescaling variable 
x{x, Q) for all terms. This would restrict the phase space to the physically allowed region, 
and, importantly, also ensure consistency between the LO and higher-order terms with re- 
spect to a shift of the factorization scale. However, this prescription is not unique; and it 
needs some careful consideration. For instance, the rescaling prescription Eq. ([2]) involves an 
upward shift of the variable a; by a constant factor (l + Mj/Q"^), even though the physical 
mass-threshold kinematic constraint is mainly a large- a; (low-H^) issue for a given Q. Since 
parton distributions are rapidly varying functions at small x, the suppression of phase space 

"^This rescaling is more than just a convenient prescription. Its emergence can also be seen in the origin 
of the LO term as the resummation of coUinear singularities of higher order terms in the limit Q ^ Mj: if 
mass effects are kept, this variable will appear naturally in the collinear terms, cf. [20]. 
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due to the substitution x — x could unnecessarily suppress the contribution from the gluon 
fusion term in the small-x region, causing potential disagreement with the accurate DIS data 
from HERA. 

It is therefore desirable to generalize the rescaling variable x, so that it enforces the 
mass-threshold constraint at large x in the same natural way, but smoothly recovers the 
standard scaling x variable away from the mass threshold {i.e., at small x) in a controllable 
way. We introduce a new rescaling variable ( using the relation, modeled after Eq.([2]): 

x = C{l+eMpQ')-\ (3) 

where the parameter A is a positive number. The variable ( has the following properties: 

a) it reduces to Bjorken x for large Q{(^xasQ^ ^f)i and the C 1 limit corresponds 
to the physical mass threshold {W — >■ Mj) — both key features of a rescaling variablejj 

b) for non-zero A, the rescaling effect is reduced as x moves toward smaller values, away 
from the threshold region where it is required; and 

c) for a given Q, x < C, < Xi i-e- C li^s in-between the ZM x (which violates non-zero mass 
kinematics) and the uniform rescaling variable x (which may be too restrictive). 

The rescaling factor C/^^ is shown in Fig.[T] 
for several values of A. The top horizontal line, 
A = 0, corresponds to uniform rescaling C = X- 
The base line corresponds to the ZM case C, = ^■ 
Those two lines are parallel, since x/^ is a con- 
stant factor, as already mentioned. The other 
curves correspond to A = 0.1, 0.2, and 1, re- 
spectively. The amount of suppression of phase 
space away from the physical mass threshold 
can be controlled in a smooth way by varying 
the parameter A. The small-x rescaling is re- 
duced for increasing values of A; and it has prac- 
tically vanished below x = 0.1 even for a mod- 
erate choice of A = l|^ The range of variation of 
the rescaling variable, represented by the width 
between the two horizontal lines, is Mj/Q"^; it 
shrinks rapidly for increasing values of Q. Thus, rescaling at high energy scales is negligible 
for any choice of A for all x. 

The rescaling variable ( is simple and flexible. It can be used effectively to explore how 
well the conventional ZM formulation can be improved to yield more reliable predictions. In 
the next section, we will show examples of global fits based on improved ZM calculations 
with this variable, with selected values of the parameter A. We will find that the generalized 
rescaling variable is the key requisite needed to correct the kinematical dependence of the 
ZM cross sections, while the other modifications (discussed in the next subsection) play a 
secondary role. 




Figure 1: The rescaling factor (/x vs. x 
for A = 0, 0.1, 0.2, and 1. 



^Specifically, one can show that = ^(1 + C^-jA^//Q^)C ^ ~ -'^)' neglecting ordinary hadron masses. 

^The limiting case C = a; is reached in Eq.® formally by taking A — > oo. But, as can be seen from 
the steep rise of the rescaling factor curves for larger values of A in Fig.[Tl the transformation as a; —> 1 is 
ill-defined in this limit. For all practical purposes, A « 1 already produces enough small- a; suppression of 
rescaling to produce results close to that using directly = x, while ensuring kinematic bounds. 



3.2 Correct summation over physical final states 

Next, we turn to the issue of summation over final-state partons, of Eq. ([T]) — 
point (ii) raised in Sec. El This is intimately related to the choice of partonic subprocesses 
included in the calculation for the given combination of kinematical parameters. Consider, 
in particular, production of a 6 quark in neutral-current DIS discussed in item (ii) of the 
previous section (x ~ 10"'', Q ~ 3 GeV, W ~ 300 GeV). Since bb pairs are physically 
produced in the final state, the gluon fusion subprocess ■y*g bb must be included in the 
sum of non-zero scattering contributions independently of the choice of the factorization 
scheme and scale. In the conventional implementation of the ZM scheme, with the scale 
fi = Q = 3 GeV being below the bottom mass, this subprocess is not included, resulting 
in the dilemma mentioned in Sec.[2l But, there is no reason, in principle, to tie the final- 
state parton summation to the initial-state one, Yla- -'^^ fact, in the fixed-fiavor-number 
scheme, as in the GM VFNS, the subprocesses '~f*g cc, bb are always present whenever 
there is enough CM energy to produce these heavy-flavor states. Therefore, we can try to 
adopt the same physically sensible approach in the ZM scheme, i.e. we shall include these 
subprocesses while keeping the ZM hard matrix elements. 

However, this seemingly straightforward "fix" of the conventional implementation of 
ZM is not entirely trivial. The ZM matrix element for the partonic subprocess 'j*g — > bb is 
obtained after a subtraction, 

'^b,xi^^) = C'fe,A( V^)unsubtracted " MS subtraction(l/£:, /i), (4) 

where the subtraction term represents the coUinear singularity due to the zero quark mass 
approximation. This term is sensitive to fi, and in general its fi dependence matches that of 
the PDF b{x, fi) in the LO 7*6 b term. If this LO term is absent because of the conventional 
scale choice fi = Q while Q < nib, there would be a mismatch; then the simple addition of 
the j*g — > bb subprocess, using Eq. (jl]), would be incorrect. However, this problem arises 
only because of the choice of the default scale fi = Q, not because of the ZM formalism 
itself. For heavy-flavor production, it is in any case natural to choose a factorization scale 
fi > mb, e.g. /i = \/Q'^ + ml. Then the corresponding LO will be present, and consistency 
is restored. Clearly, any choice of /i that stays above and approaches Q at high energies 
would be equally acceptable. The differences will be of higher order. We have verified that 
the results are not sensitive to the scale choice, hence will use the above as the default. 

There are also other ways of compensating for the MS subtraction to the gluon-fusion 
contribution if the LO processes 7*6 — > b and 7*6 — > b are absent. For instance, in the GM 
calculation, the term that removes the collinear singularity of the unsubtracted gluon fusion 
term is [4], 



as ( /i 



27r \m 



(5) 



where fg is the gluon distribution, Pgg is the g ^ q splitting function, and C^^^^ is the LO ZM 
matrix element for 7*6 — > b contribution to the structure function Fx- This logarithmic term 
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is equivalent to the subtraction term in Eq. (jl]), cf. [4]. Therefore, one can consider adding 
([5]) to the ZM matrix element ^{fj,) in order to maintain consistency (scale independence) |3 

The modified formulations of the ZM described in the above two subsections are designed 
to remedy the obvious problems of the conventional ZM in both the (large x, small W) and 
the (small x, small Q) regions, as identified in Seed) They illustrate the point that the 
simplicity of the ZM parton formalism can be retained without the manifest violation of 
physical requirements. The proposed modifications of the conventional implementation are 
not unique. The differences between the modified choices all vanish at energy scales much 
larger than m^; and they usually are one order higher in a^. 

These improvements are all motivated by, and adapted from, recent implementations 
of the more accurate GM formalism [6,9]. They belong to an intermediate stage between 
the ZM and the GM formalisms, hence will be designated as intermediate-mass variable 
flavor number schemes (IM VFNS, or IM scheme). They can be viewed either as improved 
ZM calculations with physical treatment of final states inspired by GM, or equivalently, as 
simplified GM calculations with suitably defined ZM hard cross sections. 

3.3 Comparison of calculational schemes with heavy-quark data 

To carry out a comparative study, we have implemented the conventional ZM and the var- 
ious formulations of the IM scheme described above in our CTEQ global analysis program, 
alongside with the existing implementation of the GM scheme that by now has become the 
standard. 

As a first step, we compare the heavy-quark calculational schemes by evaluating inclusive 
DIS cross sections with heavy quarks in the final state, using known PDFs. These cross 
sections are most directly affected by the heavy-quark mass treatment, hence give us a clear 
measure of the significance of the various calculational schemes. The data sets consist of 
charm production from HI [21-23], ZEUS [24,25], CCFR and NuTeV [26]; and bottom 
production from HI [22,23]. The total number of data points is 222. 

For this initial comparison, the cross sections are calculated in the conventional ZM 
formulation, the GM formulation, and the IM formulation of Sec. 13.21 For each of these 
calculations, we first evaluate the cross sections using the PDFs of the latest GM global fit 
CTEQ6.6M [7] (abbreviated as CT66). These numbers are useful as references for assessing 
the results. The other PDF set used for comparison is CTEQ6.1M [11] (abbreviated as 
CT61), which represents the latest CTEQ ZM PDF set (2003)@ The values for each 
combination of the calculational scheme and PDF set serve as a measure of the goodness-of- 

2 

Since /fc ~ lii(^)[-Pgg ® fg] in the fi ^ mi, region, this term is essentiahy the same as the (absent) LO 

term [C^^°^ (8) fb]- Thus, the underlying physics is similar to that of the default of the previous paragraph, 
and the effect on the results turns out to be similar, too, cf. Sec. 14.21 

®The original CT61 analysis did not include the latest data sets for semi-inclusive DIS heavy-quark 
production. The effect of including the newest heavy-quark data on the ZM PDFs will be described in the 
comparisons presented below. 
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fit in the comparison of tlieory witli data. Tliey are presented grapfiically in Fig.[2l In tlie 
two IM calculations, we adjust tlie parameter A in tlie rescaling variable ( of Eq. ([3]) so as 
to obtain the lowest possible with the heavy-quark data for each examined PDF set. 

* The GM calculation using CT66 (leftmost bar) agrees perfectly with the experimental 
data — of 186 for 222 data points. The dashed line, corresponding to x^/point = 1, is 
drawn for reference. 

* The ZM calculation using CT66 PDFs (sec- 
ond to rightmost bar) fails dramatically com- 
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ZM 
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Figure 2: The log-likelihood fo^' 222 
data points from neutral- and charged- 
current heavy quark semi- inclusive DIS, 
obtained in the GM, IM, and ZM schemes. 



pared to data by comparison. Since the CT66 
PDFs, as far as we know, are the closest to the 
true PDFs, the ZM-CT66 column gives a rea- 
sonable measure of the size of the error of the 
ZM calculation in the conventional implementa- 
tion — ~ 1200/222 points — a very large dis- 
crepancy. The same point is illustrated by the 
GM-CT61 column (second leftmost). In this 
case, since the GM scheme comes closest to the 
correct calculation, the discrepancy provides an 
approximate measure of how the CT61 PDFs 
are distorted to compensate for the imperfect 
calculation used in the original ZM fit. 

* The value is the largest in the case of the ZM-CT61 calculation. This discrepancy is 
partially accounted for by the fact that the heavy-quark data did not exist at the time of 
the CT61 global analysis. This number comes down to = 406 if the recent heavy-quark 
data are included (by performing an updated ZM analysis), as shown by the darker- colored 
part of the ZM-CT61 bar. But the updated CT61 PDFs still do not achieve the level of 
agreement with the data observed in the GM-CT66 calculation, as can be observed from the 
comparison of the leftmost (GM-CT66) and rightmost (ZM-CT61) bars in the figure. 

* The impact of the IM scheme is illustrated by the IM-CT66 calculation. We chose A = 0.05 
in the rescaling variable ( so as to attain the lowest x"^ with the examined heavy-quark data 
sample. The IM scheme dramatically reduced the ZM-CT66 mismatch, with decreased 
from 1240 to 337. Corrective measures adopted in the IM formulation, addressing manifest 
flaws of the conventional ZM treatment, do have a decisive, positive effect. The IM-CT61 
fit similarly performs better than ZM-CT61, resulting in the reduction of x^ to 347, reached 
for A = 0.220 

These results provide an incentive to pursue a more detailed study of the IM formulation, 
including a new global fit of PDF's in this formulation. Since these calculations retain the 



^The original ACOT rescaling variable x (defined in Eq. ^ and equivalent to the variable C with A = 0) 
produces worse IM fits, with the log-likelihood = 354 in the CT66 case and 650 in the CT61 case. 
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zero-mass matrix elements, they are still only approximations to the GM scheme. What 
we have seen here is that, in practice, the numerical effects due to the final-state counting 
and phase space treatment (that have been corrected) are larger than those due to the 
approximate hard matrix elements (that remain). Thus, it is possible that the core feature 
of the ZM scheme — its simple hard matrix elements — can be preserved in a viable global 
analysis. 



4 Global Analyses 

We now turn to the comparison of global analyses performed using the various schemes. The 
goal is to quantitatively study how the proposed IM schemes bridge the gap between the GM 
scheme (CT66) and the conventional ZM (CT61 and its updated equivalent). We will explore 
in detail the efficacy of several realizations of IM scheme described in Sec.[3l by comparing 
both the parton distributions and the physical predictions of complete global analyses based 
on these realizations with those of the conventional ZM and the GM formulations. 



4.1 Comparison of global fits in the ZM, IM and GM schemes 



Conventional ZM: 3060 



o 

'f^ 2800 4 



CT66: 2760 



Zacoi rescaling; 2973 



Best IM fit: 2791 



We found that among the possibilities ex- 
plored, the choice of the rescaling variable (Sec. l3.ip 
has the largest impact. We present here re- 
sults from four IM global fits, for A = 0.3, 0.15, 
0.05, and in the rescaling variable ( defined in 
Eq. ([3]). These fits are identified as IMa, 1Mb, 
IMc, and IM^. The conventional ZM fit (de- 
noted as ZMO) is close numerically to all IM 
fits with A > 1. All these fits use the same ex- 
perimental input data and parametrizations of 
the input nonperturbative PDFs as in the ref- 
erence CT66 fit (denoted by GM) [7], in order 
to allow meaningful comparisons. Figure 3: The global values obtained in 

The goodness-of-fit values in these global the GM, IM, and ZMO schemes discussed 
fits are shown in Fig.[3l For a total of 2716 data jj^ the text, 
points, the best fit is given by the reference GM 

fit CT66 (x^ = 2760). This is physically significant — the GM QCD formalism is in better 
agreement with a wide variety of global data than all the approximate ZM and IM formula- 
tions. The quality of the IM fits shows sensitive dependence on the value of the A parameter 
that controls the rescaling behavior. The x^'s of the two worst fits, ZMC0 and IMx at the 



IM-/ 



i°The ZMO fit is similar, but not identical, to the earlier CTEQ6.1 fit [11], cf. footnote El The results of 
this ZM fit were used in obtaining the "updated CT61" column of Fig.O 
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upper and lower ends of this group, are ~ 300 higher than in the reference fit (for 2716 
data points) — beyond the acceptable range of fits characterized by the 90% C.L. criterion of 
CTEQ analyses (~ Ax^ < 100 [6,7,10,11]). In between these two special cases, the quality 
of the global fit spans a continuous range, with the best fits to the global data occurring 
for the A parameter around 0.15. The 1Mb fit (A = 0.15) has an overall of 2791 — only 
31 above that of the CT66 fit. It provides reasonable description of the heavy-quark SIDIS 
data examined in Section [3l3l with = 212 for 222 data points. By these criteria, it is well 
within the usual range of acceptable fits. 

All the new IM global fits shown in Fig. [3] use the default prescription for treating 
heavy-flavor final states described in Sec. 13.21 As pointed out there, one can alternatively 
choose either a different scale (other than the default /i = a/Q^ + '^^), or a different way 
of compensating the MS subtraction in the ZM gluon-fusion matrix element. We have 
investigated the effects due to both of these options on the global fit. They turn out to be 
relatively small compared to those of varying A described above: the overall of the global 
fit does not change by more than ~ 15 for all reasonable alternative choices in either cases 
(for any given A). 

Since the IM calculations still retain the ZM hard matrix elements, the IM PDFs will 
involve some adjustments made to compensate for this approximation. We shall try to 
quantify the deviations of the PDFs of the these global fits from those of the GM reference 
fit, as well as the possible differences in their predictions in the next section. (We shall find 
both to be surprisingly small for the best IM fits.) 

4.2 Comparison of parton distributions 

Poor values in the bad fits (ZMO and IMx) from the previous section are caused, for 
the most part, by disagreements with the high-precision HERA data points at small x and 
small Q. This is expected, in view of the problems of the ZM scheme and the conventional 
rescaling variable x small x discussed in Sections[2] and [31 To gain more insight on this, 
we show in Figs.H] and the w-quark and the gluon distributions of the ZMO/IMb/IMx 
analyses, normalized to those of the standard fit CT66M, at two scales, Q = 2 GeV and 85 
GeV. The (i-quark distribution behaves in a similar way as for the w-quark (only with larger 
error bands), hence is not separately shown. The radiatively generated charm and bottom 
PDFs (not shown either) behave similarly to the gluon PDFs. To provide a useful reference 
for gauging the size of the deviations, we also show the uncertainty bands of the CT66 PDFs 
that result from the propagation of the input experimental uncertainties. 

We see that the ZMO and IMx PDFs lie on opposite sides of the CT66 PDFs; and the 
PDFs of the best IM fit, 1Mb, lie in between. The quark distributions of the ZMO set are 
suppressed compared to those of CT66, particularly at small x. This is because, with no 
threshold mass suppression or rescaling in the ZMO fit, the magnitude of the heavy-quark 
scattering contributions to inclusive DIS structure functions is overestimated (especially at 
small x), resulting in suppressed light-fiavor PDFs in order to compensate for excessive 
heavy-quark scattering in the global fit [6,7]. On the other hand, x rescaling introduces too 
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u at Q=2. GeV 



g at Q=2. GeV 
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Figure 4: The x dependence of the w-quark (left) and gluon (right) PDFs at Q = 2 GeV, 
plotted as ratios to the CTEQ6.6M w-quark and gluon PDFs. The blue band is the CTEQ6.6 
PDF uncertainty. The lines correspond to 1Mb (black solid), ZMO (red long-dashed), and 
IMx (green short-dashed) PDFs. 




s at Q=2. GeV 
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Figure 6: Same as in Fig. HI for the strange quark PDF at Q = 2 GeV. 

much suppression of phase space in heavy-quark DIS contributions at small x (cf. Sec. l3.ip . 
resulting in overcompensated light-flavor PDFs. The gluon distribution shows similar sup- 
pression/enhancement as the quark distribution at small x, but this behavior reverses itself 
at moderate values of x — mostly as a consequence of the momentum sum rule constraint. 

For both ZMO and IMx fits, the deviations of the light quark distributions from their 
CTEQ66 counterparts at small x are at the boundary of the PDF uncertainty bands; whereas 
those of the c, and h distributions stay generally within the bands. With the rescaling effect 
operative at large x (where it is well motivated), but suppressed at small x (where it is not 
necessarily needed on physical ground), the intermediate fit, 1Mb, yields the PDFs in-between 
the two extreme cases. The u and d (anti)quark PDFs in the IM scheme are consequently 
closer to the CT66 reference PDFs (surprisingly so for the gluon). The rescaling variable C, 
adopted in Section [3TT] is thus capable of bringing both the global value and shapes of the 
majority of the PDFs in the IM analysis close to those in the full GM scheme. 

The strange-quark PDFs s(x, Q), shown in Fig. [6l look more different than the other 
flavors. In global analysis, s{x,Q) is not as well constrained in general because it is only 
sensitive to data on charm production in neutrino scattering, which is not very precise. The 
IMx PDF is in better agreement with the CTEQ6.6 PDF than the 1Mb one, suggesting 
that the charged- current cross sections prefer a smaller value of A (i.e., closer to ACOT x) 
than the overall value of 0.15 for the best fit. (The rescaling variable C, has different mass 
dependence in the two cases.) 
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o-(NLO) in units of o-(CTEQ66M) 
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ff(171) 
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Figure 7: Representative NLO cross sections at the Tevatron and LHC, normalized to the 
CTEQ6.6M cross sections. 

4.3 Comparison of Physical Predictions 

To see how well the IM PDFs do in high-energy physics phenomenology, we have compared 
theoretical total cross sections for a number of representative scattering processes at the 
Tevatron and the LHC using the new PDF sets described above. In this calculation, we 
adopted the procedure of the analogous comparison in the CTEQ6.6 paper [7]. To probe 
typical combinations of parton distributions and x values, we show cross sections for pro- 
duction of and bosons; top quark-antiquark pairs (for top quark mass rrit = 171 
GeV); Standard Model Higgs bosons with mass 120 GeV in gg fusion [27, and references 
therein]; supersymmetric neutral CP-odd Higgs bosons with masses 120 and 200 GeV 
in bb annihilation [28]; and supersymmetric charged Higgs bosons in cs scattering [29]. 
Details of these calculations are provided in Section 4 of Ref. [7]. The cross sections are 
evaluated at the next-to-leading-order in the QCD coupling strength using the program 
WTTOT [30]. 

Fig. [7] compares the IM and CTEQ6.6 total cross sections. The central predictions 
(denoted by boxes) correspond to the best-fit CTEQ6.6 and 1Mb PDF sets. Filled red boxes 
and error bars denote the CTEQ6.6 predictions and the PDF errors due to the propagation 
of experimental uncertainties. Empty blue boxes are for the best-fit IM fit with A = 0.15; 
the dashed lines represent the range of IM predictions for the rescaling parameter A in the 
range < A < 0.5. We see that the 1Mb central predictions, and the IM ranges, mostly lie 
within the CTEQ6.6 PDF uncertainty bands. The IM calculational scheme is thus able to 
reproduce general features of a variety of CM cross sections. The only exception seen, for 
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(a) (b) 

Figure 8: NLO and Z production cross sections at the LHC, for CTEQ6.6M and the 
respective PDF uncertainty (red triangle and dashed elhpse); for best-fit ZMO, IMa, 1Mb, 
IMc, and IMx PDFs (left subfigure, filled blue boxes); for GM fits with A = 0.05, 0.15, 0.3 
(right subfigure, empty red triangles). 

cs — > at the LHC, is due to the strange quark as mentioned at the end of Section 14.21 

The IM results for W and Z production cross sections at the LHC are shown in the 
two-dimensional plot of Fig.[8](a). The ellipse in the plot corresponds to the CTEQ6.6 PDF 
uncertainty range, similar to the bands shown in the previous figure. We see that the ZMO 
and IMx predictions are far away from the CTEQ66M one, in opposite directions; while the 
intermediate ones are close to the boundary of the uncertainty ellipse. Not surprisingly, the 
1Mb point is very close to the reference CT66 one. These results mirror closely the range of 
variation of the M-quark distribution at ~ 10~^ in Fig. [5] (with similar results for the d quark 
that is not shown), since the W/Z cross sections are dominated by the quark-antiquark 
annihilation process with the kinematic variable range x ~ 10^^ and Q ~ 80—90 GeV. 
In particular, we see that the 1Mb quark line in Fig.Et is extremely close to CTEQ6.6M 
(horizontal line) in this range. 

The 1Mb prediction in Fig. [8](a) lies in the lower part of the CTEQ6.6 error ellipse, 
suggesting that the 1Mb ratio of the Z and W total cross section, crtot{Z) / atot{W) , is lower 
than in the CTEQ6.6M case. This is again consistent with the observation that the 1Mb 
strange PDF s{x, Q) is lower than the CTEQ6.6 one in the range of x ~ 10~^ typical for 
W and Z production at the LHC. As noticed in Ref. [7], the ratio (Ttot{Z) / atotiW) ~ 0.1 at 
the LHC, despite being well-constrained, is quite sensitive to the form of s{x, Q). A smaller 
magnitude of s(x, Q) thus results in a smaller CtotiZ) / atotiW) ratio. 
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4.4 Generalized rescaling variable in the GM scheme 

The IM scheme results discussed above are sensitive to the choice of the parameter A in 
the definition of ( — an indication of its phenomenological origin. The choice of the scahng 
variable, involving powers of m/Q, is inherently an issue beyond the usual perturbative 
formalism. In fact, this ambiguity exists also in the GM scheme. 

In principle, we could apply the proposed generalized rescaling variable ( to the GM 
analysis (in place of the default ACOT-x) as well. Will the GM formalism be stable with 
respect to variations of the A parameter in ^? We expect the answer to be "yes", since, as a 
PQCD theory including heavy-quark masses as the basic parameters of the Lagrangian, the 
GM formalism contains built-in compensation between the ( dependence of the contributions 
from heavy-quark-initiated subprocesses and corresponding subtraction terms applied to 
gluon-fusion subprocesses at each order of as- 

We have verified that this expectation indeed holds in practice. Specifically, in contrast 
to the large difference in the overall of about 180 between the global fits IMx (A = 0) 
and 1Mb (A = 0.15), the difference between the values of two GM fits for these same A 
values is only 12. The CTEQ6.6M PDF, corresponding to A = 0, has a slightly better 
than the GM fits with non-zero A values, suggesting that the ACOT variable x (motivated 
by the exact kinematics of the heavy-quark pair production in the gluon fusion process) does 
the best job of all in the GM case. The GM PDFs for A as high as 0.4 remain within the 
band of the CTEQ6.6 PDF uncertainty, while the IM fit with A = 0.4 is strongly disfavored. 
Predictions for the LHC W and Z cross sections based on GM fits with A < 0.3 lie within 
the CTEQ6.6 PDF error ellipse, as illustrated by Fig. MJo)- Sensitivity of GM cross sections 
to A is clearly reduced in comparison to the IM calculation. 

5 Concluding Remarks 

For many years, the ZM variable-fiavor number factorization scheme has given a high-quality 
description of existing global hard-scattering data and provided predictions for a wide range 
of high-energy processes. Even though by now the GM scheme has superseded the ZM scheme 
as the more precise formalism, the ZM scheme still has a lot of appeal for practical reasons. It 
continues to be use in most phenomenological calculations. For this reason, we carry out the 
present study to lay out more explicitly the approximations and inconsistencies inherent in 
the conventional implementation of the ZM scheme. We show how these inconsistencies, due 
to the ad hoc imposition of the heavy-quark mass thresholds on the zero-mass QCD theory, 
can be corrected by a more careful physical treatment of the heavy-fiavor final states, while 
preserving the simplicity of the ZM hard matrix elements. Our proposed intermediate-mass 
(IM) calculational scheme can be considered either as improved ZM formulations with GM 
kinematics of final states, or simplified GM formulations with ZM hard matrix elements. 

The key element that makes the IM scheme useful is the introduction of a fiexible rescal- 
ing variable ( that generalizes the mass-dependent rescaling variable x [20] that has already 
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been adopted by the latest global analyses in the GM scheme. The ( variable effectively im- 
plements kinematic mass threshold constraints in heavy-quark production processes, while 
minimizing unintended effects away from the physical threshold region in a smooth, con- 
trolled way by a parameter A, cf. Sec. l3.1[ This allows us to systematically investigate how 
the conventional ZM scheme can be improved while keeping the simple, well-known, ZM 
matrix elements. 

We demonstrated that global analysis carried out in the IM scheme can approximate 
the GM scheme results quite well, both in terms of the resulting PDFs, and in terms of 
typical physics predictions at the Tevatron and the LHC. The IM scheme can play a useful 
role in bringing the existing NLO analyses based on ZM hard matrix elements closer to 
the GM formulation, even without the full implementation of heavy-quark mass effects. 
Dependence of the IM predictions on the form of the effective rescaling variable underlines 
the phenomenological nature of this approach. Although this dependence in principle also 
arises in the GM formalism, we have demonstrated that it is less pronounced than in the 
phenomenological IM formulation. Thus, this additional source of theoretical uncertainty 
hardly affects what we know about the GM formalism - except that, perhaps, it should be 
added to the other sources of theoretical errors, such as scale dependence, when assessing 
the uncertainty of the GM theoretical results. 
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